Statistics of Lead Changes in Popularity-Driven Systems 
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We study statistical properties of the highest degree, or most popular, nodes in growing net- 
works. We show that the number of lead changes increases logarithmically with network size A'^, 
independent of the details of the growth mechanism. The probability that the first node retains the 
lead approaches a finite constant for popularity-driven growth, and decays as N'"^ {\nN)~^^'^ , with 
(j) = 0.08607 . . ., for growth with no popularity bias. 
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Extremes are vitally important in science and engi- 
neering. These quantities are used to determine the like- 
lihood of a rare event, such as the probability of failure 
of a space shuttle launch or of a dam in flood conditions. 
The theory of extreme statistics provides a powerful tool 
to understand such real- world situations jl]. Extremes 
are also irresistible in everyday life - we are naturally 
drawn to compilations of various pinnacles of human en- 
deavor, such as, for example, lists of the most beautiful 
people, the richest people, the most-cited scientists, ath- 
letic records, etc 

This social perspective about extremes raises new 
questions for which much less is known compared to the 
magnitude of the extreme value itself [||. For example, 
how does the identity of the leader - the individual who 
possesses the extreme value of a particular attribute - 
change as a function of time? What is the rate at which 
lead changes occur? What is the probability that a leader 
retains the lead as a function of time? 

We address these questions within the framework of 
growing networks, where the relevant quantity is the node 
degree - the number of links that join to each node. We 
view the degree as quantifying the popularity (or wealth) 
of the node, and the leader is the node with the highest 
degree. We focus on generic network models with prefer- 
ential attachment to already popular nodes |@, |[ ^, 0, D , 
and networks with random attachment. The former de- 
scribe, for example, the distributions of biological gen- 
era, word frequencies, publications, urban populations, 
and income ||] , and contemporary applications to col- 
laboration networks and the World-Wide Web have been 
developed |^ . It has been posited that a hallmark of such 
systems is "the rich get richer" - that is, more popular 
nodes tend to remain so ^ . Our basic goal is examine 
the consequences of preferential and random attachment 
mechanisms in growing networks and to test whether the 
adage of the rich get richer really does apply. 

The network grows by adding nodes, each of which 
links to a pre-existing node with an attachment rate Ak 
that depends only on the degree k of the target node. We 



choose Afe = /c-f A with A > — 1 jH ^. For such networks, 
the degree distribution has an asymptotic power-law tail, 
Nk ~ N /kr'^^ , where Nk is the number of nodes of degree 
k and TV is the total number of nodes ^. For A ^ oo 
the growth mechanism reduces to random attachment 
and the degree distribution is exponential. 



10" 



o A,=l 

AA,=k 



& 10' 



O 

O A A ^ " 



10 10 10 10 10 10 



'Electronic address: 
1 Electronic address: 



3aulk@bu.cdu 



•ednerObu.edr 



N 



FIG. 1: Average index of the leader J\eiid{N) as a function of 
the total number of nodes A'' for 10^ realizations of a growing 
network of 10^ nodes. Shown are the cases of attachment 
rates Ak — 1 and At — k. 

Identity of the leader. We characterize the identity of 
each node by its index J. A node of index J is the J'^ 
one introduced into the network. To start with an unam- 
biguous leader, the initial system contains N = 3 nodes, 
with the initial leader having degree 2 (and index 1) and 
the other two nodes having degree 1 . A new leader arises 
when its degree exceeds that of the current leader. For 
the linear attachment rate, Ak — k, the average index 
of the leader J\ead{N) saturates to a finite value of ap- 
proximately 3.4 as iV ^ CXI (Fig. |^). With probability 
« 0.9, the leader is from among the 10 earliest nodes, 
while the probability that the leader is not among the 30 
earliest nodes is less than 0.0 1. Thus only the very earli- 
est nodes have appreciable probabilities to be the leader; 
the rich really do get richer. Similarly in the general case 
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of Ak = k + X, the average index of the leader also sat- 
urates to a finite value that is a continuously increasing 
function of A. 

For random attachment {A^ — 1), the average index 
of the leader grows algebraically, Jicad(-^) ~ with 
-0 « 0.41 (Fig. p. The leader is still an early node (since 
^p < 1), but not necessarily one of the earliest. For exam- 
ple, simulation indicates that for N = 10^ a node with 
index greater than 100 has a probability of approximately 
10^^ of being the leader. Thus, in random attachment, 
the order of node creation plays a significant but not de- 
terministic role in the identity of the leader node. 

The identity of the leader can be determined from the 
joint index-degree distribution. Let Ck{J,N) be the av- 
erage number of nodes of index J and degree k. As shown 
in W, for constant attachment rate, this joint distribu- 
tion has the Poisson form. 
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From this, the average index of a node of degree k is 
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implying Jiead(^) = 7V(2/3)''»'»=. We estimate the 
maximum degree from the extreme value criterion 
Efe>fc_ Nk{N) « 1. Using Nk{N) = §, we find 

2fcmax ^ ]sf^ or fcmax lniV/ln2. Therefore 

Jiead(A^) oc iV^, with = 2- 0.415 037, 

m 2 

in excellent agreement with our numerical results. 

For the linear attachment rate, the joint index-degree 
distribution is Q 



fe-1 



(3) 



from which the average index of a node of degree k is 
Jk{N) = l2N/[{k + i){k + A)l Since Nk{N) - AN/k^ for 
the linear attachment rate |6[ |7|, the extreme statistics 
criterion X]fc>fc„„^ Nk{N) « 1 gives fcmax ~ Vn. There- 
fore Jiead(.^) ^ 12-/V/fcniax ~ ^{^) indeed saturates to 
a finite value. A similar result holds in the general case 
Ak — k + \. Thus the leader is one of the first few nodes 
in the network. 

Lead changes. We find that the average number of lead 
changes L{N) grows logarithmically in N for both the 
attachment rates Ak = 1 and Ak = k (Fig. ||). There is, 
however, a significant difference in the distribution of the 
number of lead changes, P{L), at fixed N . For Ak — 1, 
this distribution is sharply localized, with the average 
value L w 5.609 in a network of TV = 10^ nodes, while 
the maximum number of lead changes in 10^ realizations 
was 16. On the other hand, for A^ = k, P{L) has a 



significant tail and the maximum number of lead changes 
is 63. This longer tail in P{L) for linear attachment 
stems from repeated lead changes among the two leading 
nodes. Even though the distribution is visually broader, 
the average number of lead changes, L m 5.096, is less 
than that for Ak = 1- Related to lead changes is the 
number of distinct nodes that enjoy the lead over the 
history of the network. Simulations indicate that this 
quantity also grows logarithmically in iV. 




FIG. 2: Average number of lead clianges L{N) as a function of 
network size A'^ for 10^ realizations of the network for Ak = 1 
and Ak = k. 

This logarithmic behavior can be easily understood 
for the attachment rate Ak — 1- Here the number of 
lead changes cannot exceed the upper bound given by 
the maximal degree fcmax ~ In A^/ In 2. To establish the 
logarithmic growth in the general case we first note that 
when a new node is added, the lead changes if the leader- 
ship is currently shared between two (or more) nodes and 
the new node attaches to a co-leader. The number of co- 
leader nodes (with degree fc — fcmax) is N/k^:^^, while the 
probability of attaching to a co-leader is fcmax/A^- Thus 
the average number of lead changes satisfies 



^ T ^ AT\ ^max N 

L{N) oc^^. 



(4) 



Since the maximal degree fcmax grows as N^^ (2+^) ^ Eq. (^) 
reduces to dL{N)/dN (x N^^ and thus gives the log- 
arithmic growth L{N) (X InN. This argument can be 
adapted to networks with arbitrary attachment rates (ex- 
cept those growing faster than linearly with fc [Q), and 
thus the growth law L{N) cx In A^ is universal. This uni- 
versality is reminiscent of the radius of random networks 
which typically are proportional to In A^, independent of 
their construction mechanism. 

Fate of the first leader. Figure |^ shows that the degree 
distribution of the first node depends on the initial con- 
ditions for the linear attachment rate; the same is true in 
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the general case Ak = k + X while for Ak = I the initial 
condition is asymptotically irrelevant. 

We can determine the degree distribution of the first 
node analytically for the constant and linear attachment 
rates. (A similar approach is given in Ref. |^). Let 
P{k, N) be the probability that the first node has de- 
gree k in a network of N links [0. For Ak = k, this 
probability obeys the master equation 



P{k,N+l) 



1 



2N 



P{k-1,N) + 



2N ~k 
2N 



Pik,N). (5) 



The first term on the right accounts for the situation 
when the earliest node has degree fc — 1. Then a new 
node attaches to it with probability (fc — 1)/2N, thereby 
increasing the probability for the node to have degree k. 
Conversely, with probability {2N — k)/2N a new node 
does not attach to the earliest node, thereby giving the 
second contribution to P{k, N + 1). 
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FIG. 3: Degree distribution of the first node for the dimer, 
trimer (2 links to the initial node), and pentamer (4 links) 
initial conditions based on 10^ realizations of a network with 
10^ links. The curve is the prediction of Eq. (^). 

The solution to the master equation (||) for the "dimer" 
initial condition o — o is [0 



P{k,N) 



1 



{2N -k- 1)! 



(6) 



22N-k-i [N -k)\{N 
For — !■ oo, this simplifies to the Gaussian distribution 
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P{k,N) 
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for finite values of the scaling variable k/N^/"^ . Thus the 
typical degree of the first node is of the order oiN^/"^; this 
is the same scaling behavior as the degree of the leader 
node. For the trimer initial condition (which we typically 
used in simulations) we obtained the degree distribution 
of the first node in the form of a series of ratios of gamma 



functions ||ri|, in which P{k, N) has an Gaussian 
tail, independent of the initial condition. The degree of 
the first node also approximates that of the leader node 
Q more and more closely as the degree of the first node 
in the initial state is increased. 

Although A^) contains all information about the 
degree of the first node, the behavior of its moments 
{k°-)N — ^k'^PikjN) is simpler to appreciate. To de- 
termine the moments, it is more convenient to construct 
their governing recursion relations directly, rather than 
to calculate the moments from P{k,N). Using Eq. (||), 
the average degree satisfies the recursion relation 



{k)N+i - (fc)jv ( 1+ 



whose solution is 



(k) 



N 



A- 



r (N 



A 



r 



r(A) 



A^i/2, 



(8) 



(9) 



The prefactor A depends on the initial condition, with 
A = 2, 8/3, 16/5, . . . for the dimer, trimer, tetramer, etc., 
initial conditions. 

This multiplicative dependence on the initial condition 
means that the first few growth steps substantially affect 
the average degree of the first node. For example, for the 
dimer initial condition, the avera ge de gree of the first 
node is, asymptotically, {k)^ 2-\/A^/vr. However, if the 
second link attaches to the first node, an effective trimer 
initial condition arises and (fc)Ar - (8/3) VAA- Thus 
small initial perturbations lead to huge differences in the 
degree of the first node. 

An intriguing manifestation of the rich get richer phe- 
nomenon is the behavior of the survival probability S{N) 
that the first node leads throughout the growth up to size 
A^ (Fig. For the linear attachment rate, S{N) satu- 
rates to a finite non-zero value of approximately 0.277 as 
A^ ^ oo; saturation also occurs for the general attach- 
ment rate Ak = k + X. Thus for these popularity-driven 
systems, the rich get richer holds in a strong form - the 
lead never changes with a positive probability. 

For constant attachment rate, ^(A^) decays to zero as 
N ~f oo, but asymptotic behavior is not apparent even 
when A^ = 10®. A power law S'(A^) cx A^""* is a rea- 
sonable fit, but the local exponent is still slowly decreas- 
ing at N K 10® where it has reached <j){N) « 0.18. To 
understand the slow approach to asymptotic behavior, 
we study the degree distribution of the first node. This 
quantity satisfies the recursion relation 
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P{k,N) = — P(fc-1,A-1) + 



A- 1 



P(fc,A^-l) (10) 



A ' 'A 
which reduces to the convection-diffusion equation 

d d\ 1 d^P 
dh^N^dk) ^2~dk^ 



in the continuum limit. The solution is a Gaussian 
1 r (fc-lnA^)2 1 



P(fc, A) 



V27r In A 



exp 



21nA^ 



(11) 
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Therefore the degree of the first node grows as In N, 
with fluctuations of the order of Vln A^. On the other 
hand, the maximal degree grows faster, as i; In with 
V = l/hi2, and its fluctuations are neghgible. 




FIG. 4; Probability that the first node leads throughout the 
evolution for 10^ realizations of up to size A'^ — 10^ for Ak = k 
(upper), and up to A*' = 10* for = 1 (lower). 

We now estimate the large-iV behavior of S{N) as 
J2k>k P{k,N). This approximation gives 

S(N) cx / exp<^- ^ ^/ } 

JyinNVh^N "^l 21n7V / 

cx iV""* {ItiN)-^/'\ (13) 

with = (w - 1) V2 « 0.097989 . . .. The logarithmic fac- 



tor leads to a very slow approach to asymptotic behavior. 

The above estimate is based on the Gaussian approxi- 
mate for P(fc, N) which is not accurate outside the scal- 
ing region, namely, for k ^ InN + VhT/V. However, we 
can determine P(k, N) exactly because its deflning re- 
cursion formula, Eq. ( |lC| ) , is closely related to that of the 
Stirling numbers of the first kind [^] jl^ , and the solu- 
tion for the dimer initial condition is P{k, N) — [^] /Nl. 
The corresponding generating function is 

fe = l 

Using the Cauchy theorem, we express P{k, N) in terms 
of the contour integral Sn{x)/x''~^^ . When N oo, 
this contour integral is easily computed by applying the 
saddle point technique [jll] . Finally we arrive at Eq. ( p^ ) 
with the same logarithmic prefactor but with a slightly 
smaller exact exponent = 1 — v + vlnv ^ 0.08607. 

In summary, lead changes are rare in popularity-driven 
network growth processes and leadership is restricted to 
the earliest nodes. With finite probability, the first node 
remains the leader throughout the evolution. For growth 
with no popularity bias, leadership is shared among a 
somewhat larger cadre of nodes. As a consequence the 
average index of the leader node grows as N"^ with — 
0.415 037 . . .. The possibility of sharing the lead among a 
larger subset of nodes gives a rich dynamics in which the 
probability that the first node retains the lead decays as 
N^'l' (In 7V)^i/2 with = 0.08067 . . .. 
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